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Abstract — This article deals with localization probability in a 
network of randomly distributed communication nodes contained 
in a bounded domain. A fraction of the nodes denoted as L- 
nodes are assumed to have localization information while the 
rest of the nodes denoted as NL nodes do not. The basic model 
assumes each node has a certain radio coverage within which 
it can make relative distance measurements. We model both 
the case radio coverage is fixed and the case radio coverage 
is determined by signal strength measurements in a Log-Normal 
Shadowing environment. We apply the probabilistic method to 
determine the probability of NL-node localization as a function 
of the coverage area to domain area ratio and the density of 
L-nodes. We establish analytical expressions for this probability 
and the transition thresholds with respect to key parameters 
whereby marked change in the probability behavior is observed. 
The theoretical results presented in the article are supported by 
simulations. 

Index Terms — Ad-hoc network, connectivity, GPS, iterative 
localization, LBS, localization, location based services, position- 
ing, probabilistic method, random arrays, range-free localization, 
sensor networks. 



I. Introduction 

This article deals with a network of randomly distributed 
nodes some of which have a-priori knowledge about their 
position (these are the so called L-nodes) while others are 
supposed to localize themselves (these are the so called non- 
localized or NL-nodes). 

This general framework can be recognized in many practical 
scenarios and in different types of communication networks, 
such as Distributed Sensor Networks (DSNs) and wireless 
networks. Generally speaking, we may classify the localization 
algorithms in at least two ways, centralized or distributed [1] 
and range-free or based on ranging techniques [2]. The most 
common techniques are based on measured range whereby 
the location of nodes are estimated through some standard 
methods such as tri angulation. The ranging technique itself 
can be based on: 1) Received Signal Strength (RSS) mea- 
surements, 2) Time Of Arrival (TOA) measurements requiring 
clock synchronization among nodes, 3) Time Difference Of 
Arrival (TDOA) techniques requiring relative synchronicity of 
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L-nodes, 4) Angle Of Arrival (AOA) measurements, and 4) 
hybrid combination of the above. 

In range-free localization, connectivity between nodes is a 
binary event, either two nodes are within communication range 
of each other or they are not [3]. For simplicity, we may view 
this as a hard quantization of for instance RSS. If RSS is above 
a certain detection threshold, the nodes can communicate, oth- 
erwise they cannot. Of course, the nature of path loss and the 
terrain characteristics influences both the coverage radius and 
the deviation of the coverage zone from the idealistic circular 
geometry. Various range free algorithms have been proposed 
in the literature including the centroid algorithm [4], the DV- 
HOP algorithm [5], the Amorphous positioning algorithm [6], 
APIT [7], and ROCRSSl [2]. The fundamental characteristic 
of these algorithms is their ability to provide coarse (compared 
to ranging techniques) localization information with minimal 
per node communication and computation requirements. 

Regardless of whether ranging based or range-free tech- 
niques are used for localization, several global issues of con- 
cern remain to be addressed in their application. In particular, 
this article deals with broader questions of whether a given 
NL-node can localize itself in a random array of nodes, a 
percentage of which have localization information. What are 
the requirements on the density of nodes over a terrain and 
coverage area of individual nodes that would allow node 
localization with very high probability? 

These questions and others like them are at the heart of 
the localization problem and by their very nature, require a 
probabilistic setting [8]. To have a frame of reference, we 
present a communication scenario which is needed before 
any localization of NL-nodes can take place. In particular, 
we assume: 1) ranging techniques are employed and the 
coverage radius of each node is dependent on the path loss 
characteristic. While the coverage radius may vary from node 
to node, we assume that the coverage region is circular The 
fact that the coverage geometry may not be perfectly circular 
has no significant bearing on the calculated probabilities. This 
is because of our assumption of uniform distribution of nodes 
on the terrain whereby what matters in so far as probability of 
finding other nodes in a given area around a reference node 
is concerned, is the area itself and not its geometric shape; 2) 
the nodes have calibrated power levels possibly achieved via 
an external beacon signal, 3) a CSMA/CA link layer protocol 
is employed for communication among nodes, hence, while 
collisions might occur, we can assume that the nodes within 
communication range can communicate without difficulty, and 
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4) periodic broadcasts by L-nodes are used to inform NL- 
nodes that can hear them of the coordinate location of the 
L-nodes relative to some absolute reference frame. 

A variety of locaUzation techniques for both indoor and 
outdoor envirormients are currently available, offering various 
trade-offs between accuracy, cost and complexity. In this 
article we assume that each NL-node needs to communicate 
with at least three other L-nodes in order to be able to localize 
itself. A review of various localization techniques proposed 
in the literature may be found in [9]. In [10], the authors 
propose an approach based on cormectivity information, to 
derive the locations of nodes in a network. In [1 IJ, the authors 
present some work in the field of source locaUzation in sensor 
networks. 

The rest of the article is organized as follows. In section- 
II we present the theoretical analysis for the localization 
probabihty. Section-Ill is devoted to presenting the simulation 
results in support of the theoretical analysis and section-IV to 
conclusions. 

II. Analysis of Localization Probability Using 
THE Probabilistic Method 

The basic parameter set defining the problem is as follows. 

• Total number of nodes distributed uniformly over a 
circular disk of radius R (denoted as the domain) is n. 

• fc of the n nodes are assumed to be L-nodes (imply- 
ing they have localization information relative to some 
coordinate frame. How this localization is established is 
irrelevant to problem formulation), the rest are denoted 
as non-localized nodes (NL-nodes) and need to locaUze 
themselves. 

• The radio coverage radius is denoted as d and is defined 
hy d = ^/Acov/t^ (we assume d « R). In particular, 
the shape of the coverage area Acov is irrelevant in so far 
as the calculation of the probabilities is concerned due to 
the assumption of the uniform distribution of the nodes 
in the terrain. 

• The localization problem is two dimensional and three 
distance measurements relative to nodes with known 
positions is sufficient to solve for the {X, Y) coordinates 
of the NL-node unambiguously. 

• We shall neglect the boundary problem in the sense that 
the nodes near the boundary of the domain can only 
see other network elements within the domain. Hence 
in effect, their radio coverage area within which they 
may identify other nodes is reduced. This assumption 
is vahdated to be reasonably good, even when the total 
number of nodes is not very large. 

Various locaHzation techniques have been reported in the 
Uterature. The basic strategy assumed in this article is as 
follows L12J. 

1) The graph corresponding to the set of nodes is created 
with bidirectional arcs based on range. In particular, any 
pair of nodes whose true distances are within a certain 
Umit are cormected by a bi-directional edge and can 
communicate with each other. 



2) A given NL-node that has not localized itself, stores 
two sets of information generated locally: i) absolute 
coordinates of the L-nodes within its radio coverage (if 
any), ii) distance estimates to all other nodes within its 
radio coverage based on mean power measurements. We 
shall assume the measured distance information to be 
accurate. 

The basic question that we wish to answer in this article is 
as follows; given that we are at a NL-node that is interior 
to our domain, what is the probabiUty that the node can 
localize itself? To localize itself a NL-node needs at least 
three L-nodes within its radio coverage. Given the existence 
of degenerate cases, the formulas presented in the Theorems 
below really represent lower bounds to the node localization 
failure probability. However, since the degenerate cases are 
very rare, the calculated probabilities are very close to the 
true values (i.e., the bound is tight), and we have verified this 
experimentally. 

Theorem 2.1: Under the assumption of a uniform distribution 
of n nodes, k < n of which are L-nodes while the rest are 
NL-nodes, over a circular domain of radius R, and per node 

radio coverage radius of d < R, the NL-node localization 
failure probability is tightly lower bounded by: 

Pf > {(i^|^ + (l-a)^ + l}o{[fe^a+(l-6^)r-i} 

= [1 - (1 - a) • 6^]"^^ ■ [l + h\l - a){n - 3)+ 
+b^(l-a)^ <"-^»"-^h 

(1) 

where a — {1 — ^) (i.e., the fraction of the NL-nodes), 
6 = (■^) and the operator notation of vector calculus is used 
to simplify the expression. 

Proof. The assumption of uniform distribution of nodes over 
our domain implies that the probability of having a node within 
a circular disk (i.e., the radio coverage zone of a given node) 
of radius d is given by ^ ^^""^ ^ = (■^)^. In p independent 
trials, the probability of having p nodes within the coverage 
zone is simply (■^) ^ . ProbabiUty of a NL-node localization 
failure is bounded by: 

Pf > P{p <2 or {p>3 and at most two nodes 
are L-nodes while the rest are NL-nodes)} 

The probabiUty that p = 0, is the probability that all other 
(n — 1) nodes are outside the coverage zone of the NL-node 
and is given by [1 — 6^]""^. Similarly, the probability that 
p>l nodes are within the coverage radius is: 

If in addition, we require that of the p nodes at most two be 
L-nodes while the other (p — 2) nodes are NL-nodes, then the 
probabiUty of the event of interest is: 

( "p^ ^ ^,2p[i _ ^2j„-i-p + paP-^1 - a)+ 

+£^^of-2(l-a)2" 
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Fig. 1. Localization probability as a function of a for various values of b 
for a network containing n = 300 nodes. The leftmost curve is associated 
with the smallest b {b = 0.099), the rightmost curve is associated with the 
largest b (b = 0.878), with b varying as b{j) = 2^^^^ - 1, j = 3,4, ...,20. 
Increasing b monotonically increases the locahzation probability for a given a. 



Putting the pieces together, after some algebra the NL-node 
locahzation failure probability can be written as (assuming 
n > 4): 



Pf > 



n-1 

+paP-'{l -a) + - af 

This expression can further be written in the form: 

(l-a)2 92 



(3) 



Pf 



> 



d 



{[62. a +(1-62)]"-!} + 
} + 



□ 

When (1 



oa ^ 

6^ << 2/n we have the approximation: 



Pf 



1- [(n-3)-(l-a)-62 



(4) 



(5) 



Fig. [T] depicts the localization probability (1 — Pp) as a 
function of the fraction of the NL-nodes, a, for exponentially 
increasing values of h defined via the expression b{j) = 
2^/22 _ 1^ j = 3^4^ ...,20 in a network containing n = 300 
nodes. It is evident from the figure that the localization proba- 
bility increases as the fraction of the NL-nodes decreases for a 
fixed h, and for a given fraction of NL-nodes, the localization 
probability increases rapidly as a function of increasing h. 

A. Extension to Log-Normal Shadowing 

Our analysis above on the node localization failure probabil- 
ity may really be taken to represent a probability conditioned 
on two things: 1) the total number of nodes n is known, 
and 2) the radio coverage area of the NL-node is known. In 
this subsection, we wish to demonstrate that it is relatively 
straightforward to remove these conditions. In particular, we 
shall take a particular propagation model and remove the 



conditioning on the knowledge of the coverage area. To this 
end, consider the problem of localizing a specific node by 
signal power measurements. Let us consider the received 
power P{d) at a distance d from a specific point P{d) = 
Po — lOrip log]^Q(d) + Xs whereby Po is the signal power at 
a reference distance do normalized to one for simplicity, Up 
is the path loss exponent, and Xs is a Gaussian-distributed 
random variable taking into account the shadowing effect, 
i.e, Xs ^ N{iJLs,<j'l) with /^^ = 0. An estimator of d is 

given by d = 10 "'"p , where P is the measured power. 
With simple mathematical calculation, it is possible to obtain: 
io'°*^i« ^ . 10" 
' . The random variable Xi 



d = 10 lo-j 
d/d = 10" 



i""p and then 
= Xs/up is 

a gaussian random variable A^(0, af) with zero mean and 
variance — al/n^. We take d to be the true distance 
that would be measured if Xs = 0. Hence, Xs behaves as a 
perturbation affecting the measured distance d. Let the receiver 
have a detection threshold of 7. Hence, if the received power 
is below 7, the receiver does not detect the presence of a 
signal. This puts an upper limit on the estimated distance as 
dmax = 10 ^""p , leading to an upper limit of ^ < 222^, We 
note that if the received power is below 7, the transmit and 
receive nodes do not see each other. Hence the probability of 
finding a node in the coverage zone of the transmit node is 
zero. Since our probability values are proportional to the size 
of the coverage area, we interpret this as the event that d ~ 0. 

With this setup, the ratio between the estimated distance 
(when measurable) and actual one d, i.e., Y = d/d is a random 
variable with log-normal probability density function fviy) 
(initially we assume 7 = —00 dB). In fact, considering the 
following transformation applied to the random variable Xi ^ 



N{0,afy,Y ^ g{Xi) 
distributed as follows: 



10- 



it is simple to see that ^ is 



fviy) 



\9'ixi)\ 



TT 



(6) 



10 

In(lO)' 



/xi(xi) ^ NiO,a!), and \g ix,)\ 



whereby a 
^10-S. 

An important parameter in the analysis that follows is the 
coverage to domain radius ratio. Consider the random variable: 
-^ = 6= (^2) ■ {■^) whereby 60 = is a fixed constant 
representing the true coverage to domain radius ratio. With 
this setup, we have d/ R = b = bo ■ Y. Define: 



510(X)-I01ogio(6o))^ 
^ 



The probability density function (pdf) fr;{b) of b is: 



(^101ogio(!')-H)-l°Slo(''o)j 
^ 



+ {lZ^hi^)dx]S{b), 
I 



< 6 < brnax 

b > b^jidx 



(7) 

Note that 5 is a mixed random variable. The probability 
P{b = 0) 7^ 0. This follows from the argument above that 
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the event that the received signal is below the detection 
threshold is equivalent to the event that d — Q implying 
6 = 0. Again, considering the detection threshold, we have 
the upper limit b < dmax/R- To proceed further, we make 
the basic assumption that < 1 (this is almost always 

true) and that the coverage region is entirely contained in 
our domain. Note that the detection threshold 7 and path 
loss exponent have a major impact on dmax, but, we always 
assume R is large enough that the above condition holds. 

Theorem 2.2: Under the assumption of a uniform distribution 
of n nodes, k < n of which are L-nodes while the rest are NL- 
nodes, over a circular domain of radius R, and per node radio 
coverage governed by log-normal shadowing with < 1, 

the NL-node localization failure probability Pp is tightly lower 
bounded by: 



E 



n — 3 
I 



+ k2E 



(8) 

where a = (1 ^ (^■^■> ^'^^ fraction of the NL-nodes), 
= l-a, ka = (l-o)-(n-3), andk^ = (1-a)^- 

Proof. From Theorem 2.1 the NL-node localization failure 
probability conditioned on 6 can be written as follows: 



l-ki-1? 
Using the fact that: 



n— 3 



l-ki-V 



n— 3 



n-3 

E 

i=0 



1 + k2b' 



n — 3 
I 



-kib' 



it is possible to obtain the following relation: 

n — 3 



E 

(=0 



n — 3 



(-fci) 



(9) 

With this setup, the NL-node localization probability failure 
Pp can be lower bounded as follows: 



Pf = 



P 



F\b 



Mb)db. 



By substituting (|9]l in the previous equation, it is possible to 
obtain: 

jro) 

Note that the contribution of the delta function of f-^{b) in 
above integrals is zero since we are looking at non-central 
moments. This way a closed form bound for Pp can be written 
as follows: 



Pf > Er=o' 



'koE 



n — 3 
I 



(-fcl)' { 



E 



k2l 



k.E 



521+4 



(11) 
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Coverage to Domain Radius Ratio in Absence of Shadowing 



Fig. 2. Localization probability as a function of bo for a = 0.2 under both 
Log-Normal shadowing (solid line) and no shadowing (dashed line). 



□ 

Under certain conditions the expectations above are approxi- 
mated by the moments of a log-normal random variable b. The 
k-th order moment of a log-normal random variable is given 

by, 

iJ [/] = e^^+K^) (12) 

whereby ii is the mean of the random variable 10 logj^Q y. In 
particular, provided that the number of nodes n is small (less 
than 10) and the variance of shadow variable Xg is sufficiently 
small and recalHng that a — -^^j^ and fi = 10 ■ \ogiQ{bo), we 
have the approximation: 



7-) v^n— 3 1 TL 3 

Pf - E/=o 



21 , .l(2l\-'2 



21 + 2 I l/2i + 2\2 2 21+A . l/2Z + 4\2 2 



(13) 



The above expression can be written in the alternate form: 



Pf - EL-o^("7')(-fci/ 

,(2i+2)^li<!il!f 



+ fe20o e 37. T2 + /C3O0 e 37.72 

^(14) 

Note that the limiting behavior is consistent in that if Xs has 
variance of zero, cti = and the above expression reduces to 
the already known result as expressed in 

For a numerical example, consider a scenario with the 
following parameters: 7 = —80 dBm, Pq = dBm, do = 
10 cm, rip = 3.5, 12 dB, cji = 3.43 dB, i? = 40 m, 

n = 50 and k — 10. The computed dmax — 19.3 m 
and bmax = 0.48. Fig. [2] depicts the localization probability 
(1 — Pp) obtained using numerical integration for computation 
of the Expectations above, as a function of bo- For comparison 
purposes, we also show the results obtained from Theorem 2.1 
where there is no Log-Normal shadowing. Two immediate ob- 
servations from the figure are: 1) shadowing actually improves 
localization probability when bo is small, and 2) shadowing is 
detrimental to localization probability when bo is relatively 
large. 
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B. Threshold Conditions 



Returning to our analysis where we assume the knowledge 
of the radio coverage area of a given NL-node, a common 
characteristic of many problems tackled using the probabilis- 
tic method is the existence of a transition threshold where 
the characteristic of interest exhibits a large variation. The 
transition threshold observable specially for large values of b 
can be obtained by taking the second partial derivative of Pp 
with respect to a and setting the result to zero. Given that Pp 
is defined through a differential equation, it is better to work 
directly with this form. This leads to the condition: 

(i-g) 0^ _ 

(15) 



2 da' 



■ a 



-(1-62)]"-!} 



The resulting closed form expression for the transition thresh- 
old on a is given by: 

1 



1 



62(0.5n- 1) 



(16) 



At the value a*, there is a marked change in the localization 
probability. Values of a below threshold lead to a high prob- 
ability of localization of the NL-nodes. Values of a greater 
than a* lead to very low probability of one shot localization. 
The transition threshold is plotted in Fig. [3] as a function of 
the coverage to domain radius ratio. The noticeable feature is 
the rapid increase in this threshold at a value of 6 ~ 0.15. 

Fig. m depicts the localization probability (1 — Pp) as 
a function of the coverage to domain radius ratio, b, for 
exponentially increasing values of a defined via the expression 
a{j) ~ 2-'/^^ — 1, j = 1,2, ...,20 in a network containing 
n — 300 nodes. 

The transition threshold in this scenario can be obtained by 
taking the second partial derivative of Pp with respect to b 
and setting the result to zero. This leads to the closed form 
expression: 



2(l-a)(2n;*-8n^ + 10.5n-4.5) 



1 + 



1 



6(n-9)(2n3-8rt^+10.5K-4.5) 



1/2 



(17) 



(4n2-n-15)2 

For large values of n, the above expression simplifies to: 



1 



(1 — a)n 



1 + VTJb 



1/2 



(18) 



For a fixed a, there is a transition threshold with respect to b. 
At values of b below 6*, the localization probability is small 
while for values of b greater than b* the localization probability 
increases rapidly. The transition threshold is plotted in Fig.|5]as 
a function of the percentage of the NL-nodes. The noticeable 
feature is the relatively gradual increase in this threshold as a 
function of a. 

So far we have looked at what we may call a one-shot local- 
ization of the NL-nodes. What we mean is that our calculated 
localization probability looks at the event that a given NL- 
node finds at least three L-nodes within its radio coverage. 
Once a NL-node localizes itself, technically, it could become 
a reference localization node and change from the category of 
NL-node, to the category of L-node. The process of NL-node 
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Coverage to domain radius ratio, b 

Fig. 3. Probability transition threshold a* as a function of b for a network 
containing n = 300 nodes. 



localization can then repeat itself with the potentially newly 
added L-nodes. The study of this iterative localization process 
is beyond the scope of the present work. The asymptotic 
behavior however in the limiting case of infinite iterations is 
relatively easy to deduce. In particular, there is always a non- 
zero probability that at least one node may not get localized 
even if all the other nodes are L nodes. This provides the lower 
bound: 



Pp(oo) > 



2 

E 

k=0 



n- 1 
k 



b'^[l-b'] 



2in-l-fc 



III. Simulation Results 

We have simulated the node localization problem as outlined 
in the previous section. The details of our simulation setup is 
as follows. 

• We consider uniformly distributing n nodes over a do- 
main of radius R normalized to i? = 1. To achieve this, 
we generate n pairs of uniformly distributed independent 
random variables (r^, 6i). Random variable n is uninform 
in the interval [0, 1] while 9i is uniform in the interval 
[— 7r,7r]. To get a uniform distribution of nodes over the 
domain, the variable r; is transformed to such that 
the pair (y^, 9i) denotes the polar coordinates of the i-th 
node within our domain. 

• fc of the n nodes are identified as the L-nodes by 
selecting at random a set of k unique indices in the range 
{l,2,...,n}. 

• For a given NL-node, the set of distances to the k L-nodes 
are calculated and the number of L-nodes (if any) that fall 
within the coverage radius d is obtained. This is repeated 
for all the NL-nodes and the count of the total number 
of NL-nodes that can localize themselves is obtained. 

• To get good statistical averages, we have generated 1000 
realizations of the localization problem for a given set 
of parameters, n, k, a, b. We have then obtained the 
empirical estimate of node localization failure probability 
by averaging the results over 1000 reaUzations. 
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Fig. 4. Localization probability as a function of b for various values of a in 
the set a{j) = 2^/^^ ~ li i = li 2, 20 for a network containing n = 300 
nodes. The leftmost curve is associated with the smallest a. Increasing a 
monotonically decreases the localization probability for a given b. 

0.4 1 , , , , , , , , , 1 



0.35 - 



0.3 - 
o 0.25 - 
■| 0.2 - 
0.15 - 

0.1 - 




0.05 1 ^ ^ ' ^ ' ^ ^ ^ ^ 1 

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 

Peroentage of NL nodes, a 

Fig. 5. Probability transition threshold b* as a function of a for a network 
containing n = 300 nodes. 

As an example of sample simulation results while assessing 
the impact of the density of nodes over the domain. Fig. |6] 
depicts the NL-node localization probability versus a for three 
different values of n at a value of 6 = 0.05. Simulation points 
are marked by "*" while the analytical curves obtained using 
the formulas derived in the previous section are shown as solid 
lines. Note that at n = 500 the curve is convex, at n = 1000 
it is almost linear and at n = 3000 it is almost concave. 
The other noticeable effect is the reduction in the localization 
probabiUty for a fixed a as the density of the nodes within the 
domain is reduced. The gap between theory and simulation can 
be attributed to the boundary points which as stated previously, 
have an effectively smaller coverage area than the interior 
points. 

IV. Conclusions 

In this article we have presented a probabilistic setup for 
NL-node locaUzation in a randomly distributed network of 



Fig. 6. Simulated and theoretical (1 — Pp) as a function of percentage of 
NL-nodes a (simulated points are marked by "*") for three different values 
of n at fe = 0.05. 

mixed localized and non-localized nodes. The probabilistic 
method is then used to answer some fundamental questions 
regarding the feasibility of NL-node localization in such a 
network based on some basic parameters such as the density 
of the nodes and coverage to domain radius ratios. We have 
derived expressions for transition thresholds with respect to 
several key parameters whereby marked change in the local- 
ization probability is observed. 
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